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Introduction 

Methods to determine the structural similarity or distance 
between graphs have been applied in many areas of sciences. For 
example, in mathematics [1,2,3], in biology [4,5,6], in chemistry 
[7,8] and in chemoinformatics [9]. Other application-oriented 
areas where graph comparison techniques have been employed 
can be found in [10, 11, 12]. Note that the terms 'graph similarity' 
or 'graph distance' are not unique and strongly depend on the 
underlying concept. The two main concepts which have been 
explored extensively are exact and inexact graph matching, see 
[13,3]. Exact graph matching [2,3] relates to match graphs based 
on isomorphic relations. An important example is the so-called 
Zelinka distance [3] which requires computing the maximum 
common subgraphs of two graphs with the same number of 
vertices. However, it is evident that this technique is computa- 
tionally demanding as the subgraph graph isomorphism problem 
is NP-complete [14]. In contrast to this, inexact or approximative 
techniques for comparing graphs match graphs in an error- 
tolerant way, see [13]. A highlight of this development has been 
the well-known graph edit distance (GED) due to Bunke [15]. 
String-based techniques also fit into the scheme of approximative 
graph comparison techniques [1,16]. This approach aims to derive 
string representations which capture structural information of the 
underlying networks. By using string alignment techniques, one is 
able to compute similarity scores of the derived strings instead of 
matching the graphs by using classical techniques. Concrete 
examples thereof can be found in [1,16]. 

As mentioned, numerous graph similarity and distance mea- 
sures have been explored. But in fact, there is still a lack of a 
mathematical framework to explore interrelations of these 



measures. Suppose let di : C/x5^K+ and dj : Q >iQ^'^+ be 

two comparative graph measures (i.e., graph similarity or distance 

measures) which are defined on the graph class Q. Typical 

questions in this idea group would be to prove interrelations of the 

measures by means of inequalities such as d\ < dj. For instance, 

(>) 

inequalities involving graph complexity measures have been 
inferred by Dehmer et al. [17,18]. 

The main contribution of this paper is to infer interrelations of 
graph distance measures. To the best of our knowledge, this 
problem has not been tackled so far when using graph distance 
measures. However, interrelations of topological indices inter- 
preted as complexity measures have been studied, see 
[7,19,20,17,18]. For instance, Bonchev and his co-workers 
investigated interrelations of branching measures by means of 
inequalities [7,19,20]. Dehmer [17] examined relations between 
information-theoretic measures which are based on information 
functionals and between classical and parametric graph entropies 
[18]. We here put the emphasis on graph distance measures 
which are based on so-called topological indices. These measures 
themselves have not yet been studied. Note that we only consider 
distance measures (without loss of generality) as they can be 
easily transformed into graph similarity measures [21]. In order 
to define these measures concrete, we employ an existing 
distance measure (see Eq. (6)) and the well-known Randic index 
[22], the Wiener index [23], eigenvalue-based measures [24], 
and graph entropies [17,25]. Also, we discuss quality aspects of 
the measures and state conjectures evidenced by numerical 
results. 
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Methods and Results 

Topological Indices and Preliminaries 

In this section, we introduce the topological indices which are 
used in the paper. A topological index [23] is a graph invariant, 
defined by 



/: g- 



Simple invariants are for instance the number of vertices, the 
number of edges, vertex degrees, degree sequences, the matching 
number, the chromatic number and so forth, see [26] . 

We emphasize that topological indices are graph invariants 
which characterize its topology. They lia\-c l)cen used for 
examining quantitative structure-activity relationships (QSARs) 
extensively in which the biological activity or other properties of 
molecules are correlated with their chemical structures [27]. 
Topological graph measures have also been applied in ecology 
[28], biology [29] and in network physics [30,31]. Note that 
various properties of topological graph measures such as their 
uniqueness and correlation ability have been examined too 
[32,33]. 

Suppose G = ( V,E) is a connected graph. The distance between 
the vertices u and v of G is denoted by d(u,v). The Wiener index of 
G is denoted by W(G) and defined by 



WiG)= d{u,v). 



(2) 



The name Wiener index or Wiener number for the quantity 
defined is common in the chemical literature, since Wiener [34] in 
1947 seems was the first who considered it. For more results on the 
Wiener index of trees, we refer to [35] . 

In 1975, Randic [36] proposed the topological index R (R-\ 
and -'^-i) by using the name branching index or connectivity index, 
suitable for measuring the extent of branching of the carbon-atom 
skeleton of saturated hydrocarbons. Nowadays this index is also 
called the Randic index. In 1998, BoUobas and Erdos [37] 

generalized this index by replacing — - by any real number a, 

which is called the general Randic index. In fact, the Randic index 
and the general Randic index became the most popular and most 
frequently employed structure descriptors used in structural 
chemistry [38]. For a graph G = ( V,E), the Randic index R{G) 
of G has been defined as the sum of {d{u)d(v))~^^^ over all edges 
uv of G, i.e., 



^{d{u)d{v)y 



(3) 



For a given graph G with n vertices, k\,k2,---,^n are the 
eigenvalues of G. The energy of a graph G, denoted by S{G), has 
been defined by 



£{G)=Y,\U, 



(5) 



1=1 



due to Gutman in 1977 [40]. For more results on the graph 

energy, we refer to [41,24,42]. 

Novel Graph Distance Measures 

Now we define the distance measure [2 1] 



/ x—y \2 

d(x,y)=l-e ^ " ' , 



(6) 



which is a mapping d : IRxlR-»IR+. Obviously it holds 

d(x,x) = 0, d(x,y)>0, and d(x,y) = d(y,x). In order to translate 
this concept to graphs, we employ topological indices and obtain 



di(G,H) : = d(I(G),I(H)) = 1 - e" 



/ /(G)-/(g) \ 



(7) 



Further we infer a relation between the maximum value of dj 
and the extremal values of /. 

Observation 1. Let Q be a class of graphs. Suppose G,HeQ, then 
G,H are the two graphs attaining the maximum value of di if and only if 
G,H are the graphs attaining the maximum and minimum value of 7, 
respectively. 

Proof. Let x=\I(G) — I(H)\, then di is a monotone increasing 
function on x. Therefore, the maximum value of dj is attained if 
and only if the maximum value of \I{G) — I{H)\ is attained. □ 

From Observation 1 and some existing extremal results of 

topological indices, we obtain some sharp upper bounds of dj for 
some classes of graphs. As an example, we list some of those results 
for trees. 

Theorem 1. Let T and T be two trees with n vertices. Denote by Sn 
and P„ the star graph and path graph with n vertices, respectively. 

(i). The maximum value oi dw{T ,T') is attained when T and 
T' are S„ and P„, respectively. 

((7)- The maximum value of dii(T,T') is attained when T and 
T' are Sn and P„, respectively. 

(///). The maximum value of rfo^(r,7"') is attained when T and 
T' are Sn and P„, respectively. 

(iv). The maximum value ot d£{T,T') is attained when T and 
T' are Sn and P„, respectively. 



where d(u) is degree of a vertex u of G. The zeroth-order Randic 
index due to Kier and Hall [6] is 



1 



ueV(G) V"W 



(4) 



Interrelations of Graph Distance iVleasures 

Observe that 0<(//(G,//)< 1, which implies that 

0 < e '> " ' < 1 . Some trivial properties of d/ are as follows. 
Let 5 be a class of graphs and G,HeQ. We get 



d,{G,G) = di{H,H) = Q; 



(8) 



For more results on the Randic index and the zeroth-order 
Randic index, we refer to [39,22,38]. 



di{G,H)>di{G,G) = 0; 



(9) 
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d,{G,H) = d,{H,G). (10) 



we obtain 

{h{G)-h{H)f > <x\h{G)-l2{H)f. (18) 



_(/l(G)-/l(H))2 

1-e ? > 

7 9 . 9. (19) 

1— e =1— I g ff2 



However, <// is not a metric graph distance measure, since the 
triangle inequality di{G,H) + di{H,K)>di{G,K) for G,H,Keg, Thus 
does not hold generally. Actually, we obtain a modified version of 
the triangle inequality. 

Theorem 2. Let I be a topological index. Let Q be a class of graphs 
and G,H,Keg. If 

\I(G)-I(IC,\<\IiG)-IiH)\ or \m-IiK)\<\I(H)-I{K)\, 

(11) 

then we have dAG,H) + di{H,K) > di(G,K). '-e-. 
Proof We now suppose \I(G)-I(K)\ < \I(G)-I(H)\, since the 
proof of the other case is similar. (/l((?)-/l(/r))2 / (/2(G)-/2(//))2 \ 

From the inequality |/(G)-/(.«:)|<|/(G)-/(i/)|, we get j_g ^ >l_|l_(l_e ^ )) . (20) 

, l(G)-m 2 _ J(G)-I(H\ 2 
e > >e " ' . (12) 

Thus, 

2 

d,^{G,H)> 1 - {\-di^{G,H)y . (21) 



Since e " < 1, together with Eq. (12), we have 

MG)-HK) 2 AG)-I(H) 2 ( I(H)-I(K) \2 

l+e-( 5 ) >e-( 5 ) +e \ " ) . (13) 

Therefore, we have the following inequality, 
l-e-^'^^+l-e<^^f-il-e-^'^^')>0,iH) \im-im\<mG)-MH)\, (22) 



The proof is complete. □ 

Suppose /a is also a topological index. Then if 



le., di{G,H)+d,{H,K)>si{G,K). □ 

We emphasize if the Inequalities 1 1 are satisfied, the modified 
triangle inequality holds. In practice, the triangle inequality may 
not be absolutely necessary' (e.g., for clustering and classification 
problems) and is often required to prove properties of the 
measures. 

Theorem 3. Let I\ and I2 be two topological indices. Let Q be a class 
of graphs and G,HeQ. If 



\h{G)-h{H)\>a\l2{G)-l2{H)\, (15) 

then 

\-di^{G,H)<{\-d,^{G,H)r\ (16) 

where oi>0 is a constant. 
Proof. Since 



|/i(G)-/i(//)|>a|/2(G)-/2(/^)|, (17) 



we derive similarly 

1 - di^ iG,H) > (1 - *3 (G,H)f , (23) 



where J?>0 is a constant. Therefore, we obtain the following 
theorem. 

Theorem 4. Let I\ , I2 and 7$ be three topological indices. Let Q be a 
class of graphs and G,HeQ. If 



P\h(G)-h(H)\ > \h{G)-h{H)\>a\l2{G)-l2{H)l (24) 

then we infer 

( 1 - 0^/3 {G,H)f <\-d,^ {G,H) <{l-d,^ iG,H)f , (25) 



where (x.,p>0 are constants. 

Theorem 5. Let I\ and I2 be two topological indices. Let G be a class 
of graphs and G,HeQ. If 
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we have 



|7i((?)-7i(/r)|>|/2(G)-72(/r)|+a, (26) 



2 -2HJ -ln[\-,l, iG.H) 



thm we get 1 - c//j {G,H) >e e ^ (1 _ d,^ {G,H)), (37) 



\-d,^(G,H)<e o^e {\-d,^(G,H)), (27) 



^ -2aW - ln^l-rf/^(G,7/)j where /?>0 is a constant. Therefore, we obtain the following 

^ (\-^,tnTT\\ ni-\ theorem. 

Theorem 6. Let Ii I2 and be three topological indices. Let Q be a 
class of graphs and G,HeQ. If 



where a>0 is a constant. |/3((5)_/3(^)| +j8> > \l2(G)-l2(H)\+a, (38) 

Proof. Since 

then we have 



Ifl -211 J -dj^(G.H)^ 

l-rf/j(G,i/)>e"^e ' il-di^(G,H)) (39) 



and 



\Ii(G)-Ii(H)\ > \I2(G)-I2(H)\ +a, (28) 

we infer 

(7i(G)-7i(77))'>(/2((?)-/2(/7))' + 2a|/2(G)-/2(/r)|+a2. (29) 
And therefore, 

(/l(G)-/l(J/))2 (/2(G)-/2(fl))2+2aJ/2(g)-/2(/0| + a2 \-di^{G,H)<e o"^ e ^ {\ - di^(G,IT)), (40) 

1-e >\-e ? (30) 

2 where a, ;6 > 0 are constants. 



_„2 2a|/2(G)-/2(g)| (/2(G)-/2(g)/ 

= 1— e ?e ? e ? (31) 



Theorem 7. Let Ii , /2 ana? /a ZAree topological indices. Let G be a 
class of graphs and G,HeQ. If 



2a|/2(G)-/2(/0| / _t{2(G)-^{H))f \ 

|7i(G)-7i(/r)|>|72(G)-/2(//)||/3(G)-/3(/r)|, (41) 

:l-e ^2 l-(l-e ^2 ) . (32) 



Hence, 



_oP- -2a|/2(G)-/2(g)| 

\-di^{G,H)<e {\-di^(G,H)). (33) 



i«e infer 

hi (1 - (G,70) < - (7^ In (1 - di^{G,H))- In (1 - di^{G,H)). (42) 



From the definition of (//j, i.e.. 



Proo/^ Since 

|7i(G)-7i(/7)|>|72(G)-/2(/7)|-|/3(G)-/3(7r)|, (43) 

/ /2(C)-/2(J/A ^ 

(//2(G,77)=l-e V " y , (34) we derive 

we obtain that (7i(G)-7i(77))2 > |72(G)-72(77)|2 |73(G)-73(77)|2. (44) 



|72(G) - 72(77)1 = (7^- ln(l -d,^{G,H))- (35) 



And therefore, 



Finally, by substituting (35) into (33), we get the desired result. (/i(g)-/i(/0)2 (12(G)- hmHhV^-hW)^ 



□ 1— e >1— e ^ 

(/3(G)-/3(/0> 

|7i(G)-7i(77)|<|73(G)-73(7r)|+)S, (36) ^ ^ " 



Suppose 73 is also a topological index. Then if ,j Qj^-p. (45) 

/ (/3(G) — /3(//)) \ 2 

>1- e 
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i.e, di,iG,H)>l-(l-di^iG,ir)f'^^-''^'^^\ Hence we obtain 
1 -(//j(G,//)<(l -di^iG,H)f2(G)-hm)^^ (46) 



2ciC2|/2(G)-/2(//)||/3(G)-/3(^)| 



(55) 



which implies that 



In (1 - di^ {G,H)) < iUG) - h{H))- In (1 - di^{G,H))- (47) 



By substituting (35) into (47), we easily obtain the assertion of 
the theorem. □ 

By performing a similar proof as in Theorem 7, we obtain a 
more general result. 

Theorem 8. Let /, Ii h^h, ■ ■ . , Ik be topohgical indues. Let Q be 

a class of graphs and G,HeQ. If 



\i{G)-im> n ijiG)-ijm, 



(48) 



which implies 



di^(G,H)=l -(l-di^iG,H)fi(l-di^iG,H)f2 

i> it2 



By applying the substitutions 

\h{G) - h(H)\ = a^-\n{\-d,2{G,H)) 



and 



\h(G) - h{H)\ = cj^-\n(\-d,^{G,H)) 



(56) 



(57) 



(58) 



we infer 



\n{\-di{G,H))<{-\f-'^(j^-^ n \n{\-di.{G,H)). (49) 

j=i 1 



Theorem 9. Let I\ , and I-^ be three topoloffcal indices. Let G be a 
class of graphs and G,HeQ. If 



into (56), we obtain the final result. □ 

By performing a similar proof as in Theorem 9, we obtain a 
more general result again. 

Theorem 10. Let /, Ii h^h, ■ ■ ■ , Ik be topological indices. Let Q 
be a class of graphs and G,HeQ. If 



\I{G)-I{H)\ = cj\Ij(G)-Ij(H)\, (59) 

7=1 



\h{G)-hm=cAh{G)-hm+C2\h(G)-h(H)\, (50) ^^^^ >o fir 1 <j<k, then we infer 



where ci,C2 >0, then we get 



di^{G,H)=\-i\-d,^(G,H)f 



(51) 



di{G,H)=l- n {\-di.{G,H)fj 
j=i J 



j-j ^ - Icaci, y/ln (1 - (G.H)) In (1 - rf/j (G,H)) 



(60) 



Proof Since 

|/i(G)-7i(//)|=ci|72(G)-/2(//)|+C2|/3(G)-/3(/?)|, (52) 

we derive 

\IiiG)-IiiH)f = ciMG)-l2(Hf + cl\h(G)-h(Hf 

+ lcxC2\l2{G)-l2{H)\\h{G)-h{H)\. (53) 

Therefore, 



(/i(G)-/i(g))^ 

c^|/2(G)-/2(-g)P+4l^3(gW3WP_+2q'^2l^2(g)--f2WII-f3(G)-^3WI 

1-e 



(54) 



Graph Distance Measures Based on Randic Index 

In this section, we consider the values of the graph distance 
measure based on the Randic index and other topological indices 
for some classes of graphs. Denote by W and R the Wiener index 
and Randic index, respectively. 

Theorem 11. Let Q be a class of regular graphs with n vertices and I 
is an arbitrary topolo^cal index. For two graphs G,IIeG, we infer 



di{G,H)>dR{G,H) = Q. 



(61) 



Proof. Let G and H be two regular graphs of order n. By the 

n 

definition of the Randic index, we obtain that R{G) = R{II) = - , 

which implies that R(G) — R(II) = 0. Therefore, we infer 
d]i{G,H) = 0. Since dj{G,H)>0 for any topological index, then 
we obtain the desired inequality. 
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By using the definition of the zeroth-order Randic index for two 
graphs with the same degree sequences, we obtain that 
''R(G) = °R(H). Therefi )re, we get the following theorem. 

Theorem 12. Let Q he a class of graphs with the same degree 
sequences and I is an arbitrary topological index. Then for two graphs 
G,HeQ, we infer 



diiG,H)>do^{G,H) = 0. 



(62) 



For a given graph G of order n, we get \/n — l < R{G) < - (see 



[39]). Thus, 



ii?(G)-TOi<^-v;^. 



(63) 



From (63), we infer an upper bound for dj((G,H). 
Theorem 13. Let G and H be two connected graphs of order n. Then 
we get 



dH{x,y) - dGix,y) = (a+l)-b = a-b+l. (68) 



Observe that 



b-l 



i=l 7=1 

A-1 a 

dH{yb,u) + ^fi?//(vi)+ dnjuj). 

i=i j=\ 

Therefore, we have 

W{H) -W{G)= {dHix,y) - dG(x,y)) = 
{n-a-b-\){a-b+V)>Q, 



(69) 



(70) 



_rr+4n — A—An^/n—\ 

dR{G,H)<\-e 4? . 



(64) 



\W{G)-W{H)\={n-a-b-\){a-b+l). (71) 



The equality holds if and only if G and G' are S„ and a regular 
graph, respectively. 

A path P = xqX\X2 ■ ■ ■ Xk is pendent if t/(xo)>3, d{xk) = l and 
d{Xi) = 2 for aU 1 < i < A:. Especially, a vertex v is pendent if i/(v) = 1 . 
Suppose u and v are two pendent vertices, and u' the unique 
neighbor of M. We define an operation as follows: deleting the edge 
ujJ and adding the edge uv. We call this operation "transfer u to 
v". 

Theorem 14. Lxt G = (V,E) he a graph with n vertices. Denote by 

P\ and P2 the two pendent paths attaching to the same vertex such that 
I ^"1 1 > I -P2 1 ^ 1 • Denote by H the graph obtained by transferring the j 
vertex of P2 to the pendent vertex of Pi . Then we have 



For Z)>3, it is easy to verify R{G) = R{H). Therefore 
I W{G)- W(H)\ > \R(G)-R(H)\ holds. 

For h = 2, from (66), we have 2<a<n — 4 and 
\W{G)— W{LI) |=(« — a — 3)(a— 1). By performing some elemen- 
tary calculations, we get 



{n — a—3){a—l)> ^ —Vn—l, 



1 3« / 

flr-(«-2)a+y-Vn-l-3<0 



dw(G,H)>dR(G,H). 



(65) 



(72) 



(73) 



Proof. Let G = {V,E) be a graph with n vertices. Suppose 
P\=uu\U2 ■ . .Ua and P2 = uv\V2 ■ ■ - Vt, with a>h>\. Since Pi 
and P2 are two pendent paths attaching to the same vertex, then 
we get 



for 2<a<n — A and each value of «. Therefore, from (63), we infer 
I W(G) - W{H)\ > \RiG) - R{H)\. 

For h=l, from (66), we have l<a<w — 3 and 
\W(G)— W{LL)\=a(n — a — 2). By performing some elementary 
calculations, we obtain 



n — a — b>2. 



(66) 



a{n — a—2)> ^ — \/«— 1, 



(74) 



By using the definition oi LL , we infer H =G — Vi,-\Vi, + UaVb- 
By using the definition of di, we only need to show 



I W{G) - W{H)\ > \R{G) - R(H)\. 



(67) 



a^-in-2)a+ ^ -Vn-l<0 



(75) 



Obser\'e that V{G)= V{H)= V. We will discuss the dilference 
of the distances between two vertices in G and H. Let x and y be 
two vertices of G. \£ x,yeV\{vi,}, then we have di]{x,y) = dG{x,y). 
Now we suppose x=vi,. V y ^ V{Pi)[JV{P2), then 



for 1 < a < n — 3 and each value of «. Therefore, from (63), we infer 
I W(G)- WiH)\ > \R(G)-RiH)\. The proof is complete. □ 

This theorem can be used to compare the values of the distance 
measure by using trees. Let T„ be the set of trees with n vertices 
and 
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r° = {r° : T°eT„ and each pendent path of r°has length one}. 

(76) 



I(P)= - ^Pi^OgPi. 



(78) 



Observe that for every TeT„\T,„ there must be a tree T^eT„ 
such that T can be obtained from 7"" by repeatedly transferring 
pendent vertices. Therefore, we obtain the following corollary. 

CoroUary 1. Let TeT„\T^^, there exists a tree T^eT^ such that 
dw(T,T°)>dR(T,T°). 

Actually, numerical experiments show that for any two trees 
T,T'eT„, the inequality dw(T,T')>dR(TJ') holds. We state the 
result as a conjecture. 

Conjecture 1. Let T and T he any two trees with n vertices. Then 



We denote by djp the graph distance measure based on I(p). 

In the following, we infer an upper bound for dip{G.,H). 

Theorem 15. Let G and H be two graphs with the same vertex set. 
Denote hy p = (p\,P2, ■ ■ ■ ,Pn) and p' = (p\ ,p\, . . . ,p'^ be the probability 
vectors of G and H, respectively. Lf pj<p'i for each i, then we irifer 



dip{G,H)<\-e 



(79) 



dfv(T,T')>ddT,T') 



(77) 



holds. 

As an example, we consider (all) 23 trees with 8 vertices and 
calculate all possible values ot dn'(T,T') (blue) and dg{T,T') (red) 
as shown in Figure 1. From Figure 1, we observe that 
dw(T ,T')>dg{T ,T') holds for each pair of trees T and T' . 



where ^ = ^ log (1 + - ) + log (pi + 1 ) 
1 = 1 ^ P' 
Proof. Since pi < p\ for each /, then we obtain /), </>'■+ 1 and 
\ogPi< log(p;+ 1). Then we have 



p,iogp,<(p;+i)iog(/);+i) 



(80) 



Graph Distance Measures Based on Graph Entropy 

In this section, we consider graph distance measures which are 
based on graph entropy and other topological indices for some 
classes of graphs. 

In order to start, we reproduce the definition of Shannons 
entropy [43]. Let p = (p\,p2, ■ ■ ■ ,Pn) be a probability vector, 
namely, 0</7,<l and Pi=^. The Shannons entropy of^ 

has been defined by 



=;;;.iogo?;+i)+iog(p;.+i) 



(81) 



--p'M[p]{\^-^)] + \og{p\ + \) (82) 



--p', \ogp\ log ( 1 + - ) + log (p; + 1). (83) 
Pi 




100 150 

Pairs 

Figure 1. All the values of dw(T,T') (blue) and dR(T,T') (red). The 
Y-axis denotes the values of the distance measure and the X-axis 
denotes the graph pairs. 
doi:1 0.1 371/journal.pone.0094985.g001 



Therefore, we get the inequality. 



i{p)=-Y.Pi\ogpi>i(ii')- 

i=l 

(p'i log (1 + -i^ ) + log + 1) ) = Lip') -A, 
1=1 V Pi 



i.e., I(p') — I(p)<A. Hence, 



aip' 



(G,H)=\-e' 



< 1 — e u2 



(84) 



(85) 



The desired inequality holds. □ 

In [25], Dehmer and Mowshowitz generalized the definition of 
graph entropy by using information functionals. Let G = (V ,E) be 
a connected graph. For a vertex v,eF, we define 



P(vd 



E!='i/(vy)' 



(86) 
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where / represents an arbitrary information functional. By 
substituting p{vi) to (78), we have 



\y\ 



\v\ 



If(G) = log ( ^/(v,) ) - ^ ^^log/(v,). (87) 

7=1 



where ^e{£' ,£). Tlius, 



(94) 



We denote by djf the graph distance measure based on If. 

Relations between d£{G,H) and djg(G,H) 

Denote by /ti,/t2, • ■ • the eigenvalues of a graph G. By setting 
/(v,) = |/^,,| in (87), we obtain a new expression of the graph 
entropy namely 



(Ig(G)-Ig{H))'- 



(95) 



(96) 



Ig(G)= logf ^ |/,| ) - ^„\,, Y. I^-I log 



|^-|. (88) 



(£-£')^. 1 



1 ^ £-£'. ^'°^\t) I ('i„,('/^r 



:l_e ff^ (fln2)^.g „2 C 



(97) 



Recall that the energy of G is defined as S(G) = 
Then we infer 



Ig{G) = log{£(G)) - l^^'l log l^-'l 



(89) 



From the definition of Ig(G), it is interesting to investigate the 
relation between the graph distance measures and djg. 

Theorem 16. Let G and H he two graphs of order n with 
£(G)>£(H). Denote hy M,A2, ■ . ■ ,/?,„ and /l'i,A2, ■ ■ ■ the eigenvalues 
of G and H, respectively. Let /' = maxi <,<„{|/,'-|} and 
A= mini<,<„{|/,|}. Then we get 



\n{\-d,g(G,H))> 

\n{\-de{G,H)) 2(Iog|)V-ln(l-4(G,i/)) (log^)^(90) 



(C In 2)2 



fTClii2 



where ^e(£(H),£(G)) is a constant. 

Proof. Let G and H be two graphs of order n. Let £ = £(G) and 
£' = £{H) with £>£'. Then we get 



Ig(G)-Ig(H) = 
(\og{£)-\Y.\X,\\og\l. 



1 \(91) 

iog(o-^Ei^;iiogi^-;r 



= (log(£0-log(f'))+ ( 4E \^'>\ logl^;-l - ?E l^'l logl^'-| )(92) 



<(log(f)-log(f'))+(^EK|-^EW ) (93) 



1 -(l-4(G,7/))(?^^^-.-i^V-'"n-%(c,//,).^-^ 



\-dig(G,H)> 



(99) 



(l-4(G,//))«ta2)2-e"?fE2V-l"(>-%(G.«)), 



Taking logarithm for the two sides of the above inequality, we 
have 




100 150 

Pairs 



250 



Figure 2. Values of ds{T,T') (red) and d,g(TJ') (blue). The Y-axis 
denotes the values of the distance measure and the X-axis denotes the 
graph pairs. 

doi:10.1371/journal.pone.0094985.g002 
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\Qil-digiG,Hy)> 

\nil-de(G,H)) 2(log|)v/-ln(l-J,(G,ff)) (log^im 



In 2)2 



7=1 



7=1 



(105) 



The required inequality holds. □ 

Actually, numerical experiments show that for any two distinct 
trees T,T'eT„, dt:(T ,T')>d,g(T ,T') holds. See Figure 2 as an 
example, in which we consider (all) 23 trees with 8 vertices and 
calculate all possible values of ds{T,T') (red) and dig{T,T') (blue). 
We state this observation as a conjecture. 

Conjecture 2. Let T and T' be any two distinct trees with n vertices. 
Then 



= Ef ) = ^EW(^-W) (106) 



!=1 



!=1 



,A('-'-|:itf)=>- 



2m 



(107) 



de(T,T')>dig{T,T') 



(101) 



holds. 

Using a similar proof method of Theorem 16, we can obtain a 
generalization for the distance measure based on If (see Eq. (87)). 
Let / be an arbitrary information functional and 
f{G) = ^^,_]/(i'i) be a topological index. 

Theorem 17. Let G and H be two graphs of order n with 
f{G)>f{H). Let A'= maxi<i<„{/(v,) : v,-6G} and 
k= niini<y<„{/'(v,) : v/ei/}. Then we haoe 



The last equality holds since )?i=2m. By the following 

theorem, we study the relation between d^ and djgi . 

Theorem 18. Let Q be a class of graphs with n vertices and m edges. 
For two graphs G,HeQ, let £ = £{G) and £' = £{H). Then we get 



d£{G,H)>dig^{G,H) 



and 



de(G,H)<dig^(G,H)+[l 



I6m^\{£-£'f , 



(108) 



e", (109) 



ln(l -dif{G,H))> 



\n{\-df{G,H)) 2(log^)V-ln(l-4,(G,j/)) (log^(102) 



(f; In 2)2 



.1 l^?"^ 1^ ■ 
where ae( 1 — — - A) ts a constant. 

£^£<^ 

Proof. Let G and H be two graphs with n vertices and m edges. 
Without loss of generality, we suppose £>£' . 
To show the first inequality, it suffices to prove 



where r]e(f{H)f{G)) is a constant. 

Dehmer and Mowshowitz [44] introduced a new class of 
measures (called here generalized measures) that derive from 
functions such as those defined by Renyis entropy and Daroczys 
entropy. Let G be a graph of order n. Then 



\£-£'\>\igm-igim- 

Then from (107), we derive 



(110) 



2w 2m lm{£^-£'^) ,,,,, 

igm-igm=-^ + ja= \i^a ■ (111) 



/(V,) 



fctE;=i/(v,) 



e;=i/(v7) 



(103) 



If we let /(Vi) = |'iii, then we can obtain the new generalized 
entropy based on eigenvalues. We denote the entropy by 



hm : = £ 



(104) 



For a given graph G = {y,E) with n vertices, denote by 
,l„ the eigenvalues of G. By substituting £= ^"^j 1^1 
into equality (104), we have 



If we want to prove 



£-£'> 



2m{£^-£'^) 



we only need to show 



2m(£+£')<£^£'^. 



(112) 



(113) 



From a well-known bound of energy £' >2^Jm, we have 
£'^>2m and £^>{£-^£''). Therefore, de(G,H)>dig,(G,H) 
holds. 

Now we show the second inequality. From (1 1 1), we have 



\£-£'\ - \igm-igim =£-£'- ^'"^/"^ (114) 



PLOS ONE I www.plosone.org 



9 



April 2014 | Volume 9 | Issue 4 | e94985 



Interrelations of Graph Distance Measures 



-{£-£'){l- 



2m(£ + £') 



(115) max{ , : 



v,-eG} and 1= min { 



, ^= : vjeH}. 

l<J<n ^diVj) (-123) 



<(£-£'){ 1 



-(£-£'){ 1 



2w2v 



4w7 



(116) 



(117) 



ln(l-^?,y_,^2(^"^)^ 

ln(l-c/o^(G,j/)) 2(log^)^-ln(l-^o^(G,fl)) (log|)^(124) 



(J7ln2)^ 



Therefore, we have 



\Igi{G)-Igi{H)\> {£-£') 



4m 



(££'f^' 



From the definition of the distance measure, by some 
elementary calculations, we finally infer 



(/gl(G)-/gl(fl))^ 



(119) 



rie(f{H)/(G)) is a constant. 
For fc= 1, we get 



7/i(G)= log(2w)- ^J^^^^^log^. (125) 



Furthermore, by the definition of Ifk(G), for two graphs with 
the same degree sequences, we obtain that Ifk(G) = Ifk{H). 
Therefore, we get the following result. 

Theorem 20. Let Q be a class of graphs with the same degree 
sequences and I is an arbitrary topological index. Then for two graphs 
G,HeQ, we infer 



di(G,H) > doj,iG,H) = d,f^ {G,H) = 0. 



(126) 



(£-£'9. \6nP- 
? (££')^ . 



(£-£'f- 



= 1 



/^2 



16w^ \ {£-£')■ 



-e , 



(120) 



(121) 



where a6(l — ^3^'^/ ^ ^ constant. 
The proof is complete. □ 

Relations between di(G,H) and dif(G,H) 

Let G = (V,E) be a connected graph with n vertices, m edges 
and degree sequence {d\,d2, ■ . ■ ,d„), where di = d{vi) for \<i<n. 
By setting/(v,) = <^f in (87), we can obtain the new entropy based 
on degree powers, denoted by Ifk{G) 



/A(G)= logj^^rff j - ^^tld^^ogdl 



(122) 



For k= — 1/2, the expression y~]._^/(v,)= — Ff is just 

the zeroth-order Randic index ^R{G). Then by using Theorem 1 7, 
we obtain the following result. 

Theorem 19. Let G and H be two graphs of order n with 



^R(G)>''R(H). Let 



By using the similar proof method applied in Theorem 14, we 
obtain a weaker result. 

Theorem 2 1 . Let T = { V,E) be a tree with n vertices. Denote by P\ 
and P2 two pendent paths attaching to the same vertex such that 
1^1 1 > |i'2| ^ 1. Denote by T' the tree obtained by transferring the pendent 
vertex of P2 to the pendent vertex of P\ . Then we have 

dw{T,T')>d,f^{T,T') and dR{T,T')>dif^(T,T')- (127) 

Proof. Let T = (V,E^ be a tree with n vertices. Suppose 
P\=uu\U2 ■ . - Ua and P2 = wvi V2 • • • v/, with a>h>\. Denote by 
X the degree of M, i.e., x = d(u). Since Pi and P2 are two pendent 
paths attaciiing to the same vertex, then we have n — a — b>2. By 
using the definition of T', we have T' = T — Vb^iVb + UaVt,. By 
using the definition of dj, we only need to show 



mT)- W(r)\>\Lf(T)^Lfi(T')\ and 
\R{T)-R{T')\>\Lf,{T)-lfy{T')\. 



(128) 



For a tree T with n vertices, we get Lfi(S„)<Lfi(T)<Lfi(P„). 
By performing elementary calculations, we get 

\Lfi(T)-LfiT')\ <Lf{P„)-Lf,{S„)= - (129) 

1 n — Z 



Observe that V{T) = V(T')= V. We first discuss the difference 
of the distances between two vertices in T and T'. Let x and y be 
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two vertices of T. If x,yeV\{vb}, then we have dT'{x,y) = dT(x,y). 
Now we suppose x = vi,. If y t ^(Pi)^ ^i-Pi), then 
dTi{x,y) — dTix,y) = (a + I) — b = a — b + i . Observe that 



iT(vb,u)+ ^fi?7-(v,)+ ^t,?7-("/) = 



6-1 



(130) 



dji{yb,u)+ ^c?j-((v,)+ y^^dji (uj). 

i=l 7=1 



Therefore, we get \ W{T)-W(T')\=(n-a-b-\){a-b+\). 
For h>2, it is easy to verify that If\{T) = If\{T'), i.e., 
|//i(2^)-//i(m=0. Then, 



\R{T)-R{T')\ 
1 



1 1 

2 ' v'2(x-l) ^/2 



1 

- + 



We can show that for x>A and «> 13, 

11 11 

2 y'2(x-l) 
1 



2(«-l) 



[x log x — {x—\) log (x — 1) — 2] , 



(137) 



|n/(r)- n/(r')|>|//i(r)-//i(r)| and 

\R(T)-R(T')\>\IMT)-IMT')\. 



(131) 



In the following, we suppose h=l. 

We obtain I <a<n -3 a^nd \W(T)- WiT')\=a(n- a- 2). By 
performing elementary calculations, we get 



i.e., \R(T)-R{T')\>\IfiiT)-Ifi(T')\ for «>13. For smaller n, 
we verify this inequality directly. Now suppose x = 3, then there is 
only one vertex in S whose degree is at most « — 4. Therefore by 
using (133) and (136), we get 



mT)-IMT')\ = —1— (3 log 3 - 4) (1 38) 

2(w-l) 



a{n — a — 2)> 



log(« — 1) « — 3 
2 



(132) 



for l<fl<« — 3 and each value of n. Therefore, 
I WiT) - W{T')\ >\IM T)-IJ\(T')\. 

To prove the other inequality, we need more detailed 
discussion. By using the definition of graph entropy, we get 



and 



\R{T)-R{T')\>1- 



1 1 



1 1 



\/3 V6 \V2 V^J Vn^ 



(139) 



|//i(7^)-//i(r')l = ^-^T[A-logx-(.x-l)log(x-l)-2]. 

(133) 



Let 5" be the set of the neighbors of vertex u, which does not 
contain Mi and Vi. Denote by dj the degree of a vertex in 5, where 
7 = 1,2, ... ,x — 3. If a= 1, then 



\R{T)-R{T') 



1 



V2 ^2{x-\) ^jir'.Jdi 



(134) 



;=i V"/ 



By performing some calculations, we can show that for > 3 
and M > 9, 



1 1 2 

71 sJl(x-\)~^ 

1 



(135) 



2(«-l) 



[xlogx-(x-l)log(x-l)-2], 



25000 



20000 



15000 



* Pairs 



10000 



5000 




i.e . , I r) - r') I > I //i ( r) - //i ( r' ) I for « > 9. For smaller w, we 
verify this inequality directly. If fl>2, then we have 



Figure 3. Values of d,f,(T,T') (blue) and dgiTJ') (red). The Y-axis 
denotes the values of the distance measure and the X-axis denotes the 
graph pairs. 

doi:10.1371/journal.pone.0094985.g003 
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We can verify 

--^ ^+f-^--'^^^^> ^- (3 log3-4)(140) 



for each n, i.e., \RiT)- R(T')\>\IMT)- Ifi(T')\. □ 

From Theorem 14 and 21, we obtain the following corollary. 
Corollary 2. Let T = {V,E) be a tree with n vertices. Denote by P\ 

and Pj the two pendent paths attaching to the same vertex such that 

I -Pi I > I -P2 1 ^ 1 ■ Denote by T' the tree obtained by transferring the pendent 

vertex of P2 to the pendent vertex of Pi . Then we have 

dw{T,T')>dR{T,T')>d,f^{T,T'). (141) 

Therefore, we obtain a similar result to comparing the values of 
distance measures of trees. 

Corollary 3. Let TeT i,\T^, there exists a tree T^eT^^ such that 
dw(T,T°)>dR(T,T')>d,f^{T,T°). 

Actually, our numerical results (see section 'Numerical Results') 
show that for any two trees T,T'eT„, the following inequality may 
hold. 

Conjecture 3. Let T and T' be any two trees with n vertices. Then 
dw{T,T') > dR{T, T') > d,f^ (T, T') (142) 



holds. 

By way of example, we consider all 23 trees of 8 vertices and 
calculate all possible values of djf^{T,T') (blue) and dg{T,T') (red), 
respectively, as shown in Figure 3. From Figure 3, we observe that 



djf^{T,T')<ddT,T') (143) 
holds for each pair of trees T and T' . 

Numerical Results 

In this section, we interpret the numerical results. First, we 
consider all trees with 8 vertices. The number of trees is 23 and the 
number of pairs is 253 (see [45]). From the curves shown by 
Figure 1, we see that both measures dw{T,T') (blue) and dii(T,T') 
(red) satisfy the inequality Eq. (77). From the curves shown by 
Figure 2, we observe that both measures d^(T,T') (red) and 
dig(T,T') (blue) satisfy the inequality Eq. (101). From the curves 
shown by Figure 3, we also learn that both measures dif^(T,T') 
(blue) and dR{T,T') (red) fulfill the inequality Eq. (143). By using 
this method, several other inequalities could be generated and 
verified graphically. 

Figures 4 and 5 show the numerical results by using the graph 
distance measures based on graph energy £, the Wiener index W 
and the Randic index R, respectively. We consider all trees with 
1 1 vertices. The number of trees is 235 and the number of pairs is 
27495 (see [45]). By Figure 4, we depict the distributions of the 
ranked distance values, that is, d^ (red), dw (blue), and dR (yellow). 
First and foremost, we see that the measured values of all three 
measures cover the entire interval [0,1]. This indicates that the 
measures are generally useful as they are well defined. By 



% 




Figure 4. Distributions of the raniced values of tKie distance 
measure (red), dw (blue), dR (yellow). The X-axis denotes the 
values of the distance measure. The Y-axis denotes the number of 
graph pairs. 

doi:10.1371/journal.pone.0094985.g004 

considering dyy, we observe that only a relatively little number 
of pairs have a measured value < 0.8. But a large number of pairs 
possess distance values > 0.8. When considering dR, the situation 
is reverse. The distance values of d^ seem to slightiy increase with 
some up- and downturns. However, Figure 4 does not comment 
on the ability of the graph distance measures to classify graphs 

1 n 



d 




Pairs 



Figure 5. The X-axis denotes the values of the distance 
measures d^ (red), dw (blue), dg (yellow). The Y-axis represents 
the percentage rate of all graphs studied. 
doi:10.1371/journal.pone.0094985.g005 
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efficiently. This needs to be examined in the future and would far 
beyond the scope of this paper. 

Furthermore, we have computed the cumulative distributions 
by using the measures dc (red), dw (blue), dR (yellow), respectively, 
as shown in Figure 5. In general, the computation of the 
cumulative distribution may serve as a preprocessing step when 
analyzing graphs structurally. In fact, we see how many percent of 
the 235 graphs have a distance value which is less or equal d. Also, 
Figure 5 shows that the value distributions are quite different. 
From Figure 5, we see that the curve for dw strongly differs from 
d£ and dn. When considering d^, we also observe that about 80% 
of the 235 trees have a distance value approximately < 0.5. That 
means most of the trees are quite dissimilar according to dR. For 
d]Y, the situation is absolutely reverse. Here 80% of the trees have 
a distance value approximately < 0.98. Finally evaluating the 
graph distance measure dg on these trees reveals that about 80% 
of the trees possess a distance value approximately < 0.85. In 
summary, we conclude from Figure 5 that all three measures 
capture the distance between the graphs quite differendy. But 
nevertheless, this does not imply that the quality of one measure 
may be worse than another. Again, an important issue of quality is 
fulfilled as the measures turned out to be well defined, see Figure 4. 
Another crucial issue would be evaluating the classification ability 
which is future work. 

Summary and Conclusion 

In this paper, we have studied interrelations of graph distance 
measures which are based on distinct topological indices. In order 

to do so, we employed the Wiener index, the Randic index, the 
zeroth-order Randic index, the graph energy, and certain graph 
entropies [25]. In particular, we have obtained inequalities 
involving the novel graph distance measures. Evidenced by a 
numerical analysis we also found three conjectures dealing with 
relations between the distance measures on trees. 

From Theorem 1 , we see that the star graph and the path graph 
maximize dj among all trees with a given number of vertices, for 
any topological index we considered here. Actually, this also holds 
for some other topological indices, such as the Hosoya index 
[46,47], the Merrifield-Simmons index [48,49,47], the Estrada 
index [50,51,52], and the Szeged index [53,54]. All other 
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theorems we have proved in this paper shed light on the problem 
of proving interrelations of the measures. We believe that such 
statements help to understand the measures more thoroughly and, 
finally, they are useful to establish new applications employing 
quantitative graph theory [55]. We emphasize that the star graph 
and the path graph are apparently the two most dissimilar trees 
among all trees. Similar observations can also be obtained for 
unicyclic graphs or bicyclic graphs. Therefore, in the future, we 
would like to explore which classes of graphs have this property, 
i.e., identifying graphs (such as the path graph and the star graph) 
which maximize or minimize dj. 

Another direction for future work is to compare the values of 
di{G,G') where G,G' are general graphs. For example, we could 
assume that G and G' are obtained by only one graph edit 
operation, i.e., GED(G,G') = 1, see [15]. Then, all the graph which 
fulfill this equation are (by definition) similar. This construction 
could help to study the sensitivity of the measures thoroughly. 
Note that similar properties of topological indices have already 
been investigated, see [56]. As a conclusive remark, we mention 
that dynamics models on spatial graphs have been studied by Perc 
and Wang and other researchers, see [57,58]. It would be 
interesting to study the distance measures in this mathematical 
framework as well. 
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